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DETAILED ACTION 



Claim Rejections - 35 USC §102 



1 . The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 
that form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in a patent granted on an application for patent by another filed 
in the United States before the invention thereof by the applicant for patent, or on an 
international application by another who has fulfilled the requirements of paragraphs (1), (2), and 
(4) of section 371 (c) of this title before the invention thereof by the applicant for patent. 

The changes made to 35 U.S.C. 102(e) by the American Inventors 
Protection Act of 1999 (AIPA) do not apply to the examination of this application 
as the application being examined was not (1) filed on or after November 29, 
2000, or (2) voluntarily published under 35 U.S.C. 122(b). Therefore, this 
application is examined under 35 U.S.C. 102(e) prior to the amendment by the 
AIPA (pre-AlPA 35 U.S.C. 102(e)). 

2. Claims 1, 10, and 14 are rejected under 35 U.S.C. 102(e) as being 
anticipated by Kanevsky et al (US Patent No. 5,835,888). 

3. Kanevsky et al teaches a statistical language model for very large 
vocabularies, in which the language model is constructed by splitting words into 
stems, prefixes and endings. 

Regarding claims 1 and 10, at col. 4, lines 14-50, Kanevsky discloses that 
the system comprises textual data in a machine readable form which can be a large 
corpus of text, and a vocabulary, such as a dictionary. The vocabulary is used to 
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create sub-vocabularies of components that comprise stems, endings and prefixes. 
The textual data and the vocabulary components are used to generate language 
model statistics, that comprise various sets of statistics: such as stem statistics 
(trigrams, bigrams, unigrams of a stream of stems that are produced from a stream 
of words by cutting prefixes and endings); stem/ending and/or prefix/stem/ending 
n-grams, which reads on "alphabetized text and corresponding phones." 

Additionally, at col. 4, line 64 continuing to col. 5, line 21 Kanevsky 
discloses that for a given stem S, the decoder produces a list of endings E1, E2, 
E3, . . . , which can be used as the next "word" and that the table of stems and 
allowed endings contains lists of all endings that can follow a given stem, which 
reads on "overlapping characters with previous entry are prefix delta encoded" 

Regarding claim 14, Kanevsky discloses that the invention relates to speech 
recognition systems for inflected languages, at col. 1, lines 5-6, which read on "a 
speech recognizer including an input means for receiving input speech." At col. 4, 
lines 15-16, Kanevsky discloses that the system comprises textual data in a 
machine-readable form, which reads on "a processor". At col. 4, lines 28-29, 
Kanevsky disclose textual data and the vocabulary components are used to 
generate language model statistics, which reads on "speech recognition models." 
"said processor responsive to said models and said speech input for providing 
recognition scores dependent on closet match and selecting closet match", is 
inherent from the speech recognition system. 
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At col. 4, lines 14-50, Kanevsky discloses that the system comprises textual 
data in a machine readable form which can be a large corpus of text, and a 
vocabulary, such as a dictionary. The vocabulary is used to create sub- 
vocabularies of components that comprise stems, endings and prefixes. The 
textual data and the vocabulary components are used to generate language model 
statistics, that comprise various sets of statistics: such as stem statistics (trigrams, 
bigrams, unigrams of a stream of stems that are produced from a stream of words 
by cutting prefixes and endings); stem/ending and/or prefix/stem/ending n-grams, 
which reads on "models generated from pronunciation dictionary, said 
pronunciation dictionary comprising alphabetized text and corresponding phones." 

Additionally, at col. 4, line 64 continuing to col. 5, line 21 Kanevsky 
discloses that for a given stem S, the decoder produces a list of endings E1, E2, 
E3, . . . , which can be used as the next "word" and that the table of stems and 
allowed endings contains lists of all endings that can follow a given stem, which 
reads on "overlapping characters with previous entry are prefix delta encoded" 

Claim Rejections - 35 (JSC §103 

4. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for 
all obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as 
set forth in section 102 of this title, if the differences between the subject matter sought to be 
patented and the prior art are such that the subject matter as a whole would have been obvious 
at the time the invention was made to a person having ordinary skill in the art to which said 
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subject matter pertains. Patentability shall not be negatived by the manner in which the 
invention was made. 

5. Claims 2-4, 9, and 11-13 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Kanevsky in view of Kuhn et al (US Patent No. 6,230,131), 
hereinafter referred to as Kuhn, and Das (US Patent No. 6,148,283). 

6. Regarding claims 2 and 1 1, Kanevsky teach everything as claimed in claims 

1 and 10. Kanevsky do not specifically teach that the dictionary includes a rule set 
to convert text to phones for text not in the dictionary. However, text to phoneme 
conversion is well known in the art. 

In a similar field of endeavor, Kuhn discloses a method for generating 
spelling-to pronunciation decision tree, which decision trees are used to store a 
series of yes-no questions (which reads on the "rule set") that can be used to 
convert spelled word letter sequences into pronunciations (abstract), for use in a 
speech recognition system to allow the user to add additional words to the 
recognition dictionary (col. 1, lines 49-55), which reads on "convert text to phones 
for text not in the dictionary." Kuhn teaches that implementation of the method 
allows the user to add words to the recognition dictionary without having to 
understand the nuances of building a phonetic transcription (col. 1, lines 49-55). 

Therefore, it would have been obvious to one of ordinary skill at the time of 
the invention to modify the system of Kanevsky to implement a method for 
converting spelled word letter sequences into pronunciations, for use in a speech 
recognition system to allow the user to add additional words to the recognition 
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dictionary, as taught by Kuhn, for the purpose of allowing the user to add words to 
the recognition dictionary without having to understand the nuances of building a 
phonetic transcription, as suggested by Kuhn. 

Kanevsky do not specifically teach error encoding of the entries different 
from the rule set. However, utilizing an error signal in a coding scheme is well 
known in the art. 

In a similar field of endeavor, Das teaches a system for vector quantization, 
which utilizes error vectors in the coding scheme. At col. 7, lines 1-4, Das teaches 
that the difference between a target vector and an input vector forms an error 
vector, which represents the distortion between an input and output of the coding 
system. 

Therefore, it would have been obvious to one of ordinary skill at the time of 
the invention to modify the system of Kanevsky to utilize an error signal in a coding 
scheme, as taught by Das, for the purpose of providing an efficient means for 
implementing a coding scheme. 

Regarding claims 3 and 12, at col. 4, line 64 continuing to col. 5, line 21 
Kanevsky discloses that for a given stem S, the decoder produces a list of endings 
E1, E2, E3, . . . , which can be used as the next "word" and that the table of 
stems and allowed endings contains lists of all endings that can follow a given 
stem, which reads on "encoded set is prefix data encoded." 
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Regarding claims 4 and 13, at col. 3, lines 44-45, Kanevsky discloses stems 
will always be followed by the symbol "<" and endings preceded by the symbol 
">", which reads on "delimiter character between each entry." 

Regarding claim 9, Kanevsky discloses that the invention relates to speech 
recognition systems for inflected languages, at col. 1, lines 5-6, which read on "a 
speech recognizer," At col. 4, lines 15-16, Kanevsky discloses that the system 
comprises textual data in a machine-readable form, which reads on "a processor". 

At col. 4, lines 14-50, Kanevsky discloses that the system comprises textual 
data in a machine readable form which can be a large corpus of text, and a 
vocabulary, such as a dictionary. The vocabulary is used to create sub- 
vocabularies of components that comprise stems, endings and prefixes. The 
textual data and the vocabulary components are used to generate language model 
statistics, that comprise various sets of statistics: such as stem statistics (trigrams, 
bigrams, unigrams of a stream of stems that are produced from a stream of words 
by cutting prefixes and endings); stem/ending and/or prefix/stem/ending n-grams, 
which reads on "pronunciation dictionary comprising alphabetized text and 
corresponding phones." 

Additionally, at col. 4, line 64 continuing to col. 5, line 21 Kanevsky 
discloses that for a given stem S, the decoder produces a list of endings E1, E2, 
E3, . . . , which can be used as the next "word" and that the table of stems and 
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allowed endings contains lists of all endings that can follow a given stem, which 
reads on "overlapping characters with previous entry are prefix delta encoded" 

Kanevsky do not specifically teach that the dictionary includes a rule set to 
convert text to phones for text not in the dictionary. However, text to phoneme 
conversion is well known in the art. 

In a similar field of endeavor, Kuhn discloses a method for generating to-to 
pronunciation decision tree, which decision trees are used to store a series of yes- 
no questions (which reads on the "rule set") that can be used to convert spelled 
word letter sequences into pronunciations (abstract), for use in a speech 
recognition system to allow the user to add additional words to the recognition 
dictionary (col. 1, lines 49-55), which reads on "convert text to phones for text not 
in the dictionary." Kuhn teaches that implementation of the method allows the 
user to add words to the recognition dictionary without having to understand the 
nuances of building a phonetic transcription (col. 1, lines 49-55). 

Therefore, it would have been obvious to one of ordinary skill at the time of 
the invention to modify the system of Kanevsky to implement a method for 
converting spelled word letter sequences into pronunciations, for use in a speech 
recognition system to allow the user to add additional words to the recognition 
dictionary, as taught by Kuhn, for the purpose of allowing the user to add words to 
the recognition dictionary without having to understand the nuances of building a 
phonetic transcription, as suggested by Kuhn. 
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Kanevsky do not specifically teach error encoding of the entries different 
from the rule set. However, utilizing an error signal in a coding scheme is well 
known in the art. 

In a similar field of endeavor, Das teaches a system for vector quantization, 
which utilizes error vectors in the coding scheme. At col. 7, lines 1-4, Das teaches 
that the difference between a target vector and an input vector forms an error 
vector, which represents the distortion between an input and output of the coding 
system. 

Therefore, it would have been obvious to one of ordinary skill at the time of 
the invention to modify the system of Kanevsky to utilize an error signal in a 
coding scheme, as taught by Das, for the purpose of providing an efficient means 
for implementing a coding scheme. 

7. Claim 5 is rejected under 35 U.S.C. 103(a) as being unpatentable over Kuhn 
(US Patent No. 6,230,131) in view of Das (US Patent No. 6,148,283). 

8. Kuhn discloses a method for generating to-to pronunciation decision tree, 
which decision trees are used to store a series of yes-no questions (which reads on 
the "rule set") that can be used to convert spelled word letter sequences into 
pronunciations (abstract), for use in a speech recognition system to allow the user 
to add additional words to the recognition dictionary (col. 1, lines 49-55), which 
reads on "convert text to phones for text not in the dictionary." 
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Kuhn does not specifically teach error encoding of the entries different from 
the rule set. However, utilizing an error signal in a coding scheme is well known in 
the art. 

In a similar field of endeavor, Das teaches a system for vector quantization, 
which utilizes error vectors in the coding scheme. At col. 7, lines 1-4, Das teaches 
that the difference between a target vector and an input vector forms an error 
vector, which represents the distortion between an input and output of the coding 
system. 

Therefore, it would have been obvious to one of ordinary skill at the time of 
the invention to modify the system of Kuhn to utilize an error signal in a coding 
scheme, as taught by Das, for the purpose of providing an efficient means for 
implementing a coding scheme. 

9. Claims 6-8 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Kuhn et al (US Patent No. 6,230,131) in view of Das (US Patent No. 6,148,283) in 
further view of Kanevsky (US Patent No. 5,835,888). 

10. Regarding claim 6, Kuhn and Das teach everything as claimed in claim 5. 
Kuhn and Das do not specifically teach prefix delta encoding. 

In a similar field of endeavor, Kanevsky et al teaches a statistical language model 
for very large vocabularies, in which the language model is constructed by splitting 
words into stems, prefixes and endings. Specifically, at col. 4, line 64 continuing 
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to col. 5, line 21 Kanevsky discloses that for a given stem S, the decoder produces 
a list of endings E1, E2, E3, . . . , which can be used as the next "word" and that 
the table of stems and allowed endings contains lists of all endings that can follow 
a given stem, which reads on "prefix delta encoded". 

Therefore, it would have been obvious to one of ordinary skill at the time of 
the invention to modify the system of Kuhn to implement prefix encoding as taught 
by Kanevsky, for implementation in speech recognition systems for inflected 
languages, which have very large vocabularies. 

Regarding claim 7, Kuhn and Das do not specifically teach a delimiter 
character between each entry. However, implementation of delimiters within 
portions of text or data is well known in the art. 

In a similar field of endeavor, Kanevsky et al teaches a statistical language 
model for very large vocabularies, in which the language model is constructed by 
splitting words into stems, prefixes and endings. Specifically, at col. 3, lines 44- 
45, Kanevsky discloses stems will always be followed by the symbol "<" and 
endings preceded by the symbol ">", which reads on "delimiter character between 
each entry." 

Therefore, it would have been obvious to one of ordinary skill at the time of 
the invention to modify the system of Kuhn to implement delimiters between the 
data as taught by Kanevsky, for implementation in speech recognition systems for 
inflected languages which have very large vocabularies. 
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Regarding claim 8, Kuhn discloses decision trees are used to store a series of 
yes-no questions that can be used to convert spelled word letter sequences into 
pronunciations (abstract), for use in a speech recognition system to allow the user 
to add additional words to the recognition dictionary (col. 1, lines 49-55), which 
reads on "alphabetized text and corresponding phones." 

Kuhn and Das do not specifically teach "overlapping characters with 
previous entry are prefix delta encoded". 

At col. 4, line 64 continuing to col. 5, line 21 Kanevsky discloses that for a 
given stem S, the decoder produces a list of endings E1, E2, E3, . . . , which can 
be used as the next "word" and that the table of stems and allowed endings 
contains lists of all endings that can follow a given stem, which reads on 
"overlapping characters with previous entry are prefix delta encoded" 

Therefore, it would have been obvious to one of ordinary skill at the time of 
the invention to modify the system of Kuhn to implement prefix encoding as 
taught by Kanevsky, for implementation in speech recognition systems for 
inflected languages, which have very large vocabularies. 
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Response to Arguments 



1 1 . Applicant's arguments filed August 22, 2002 have been fully considered but 
they are not persuasive. 

Regarding claim 1, applicant argues that Kanevsky splits words into 
prefix/stem/ending and then builds the language model on sub-vocabulary instead 
of on the regular vocabulary and that applicant's invention does not depend on the 
creation of a smaller sub-vocabulary and applicant's prefix does not have to have 
any pronunciation significance or morphological significance. Applicant's 
arguments fail to comply with 37 CFR 1 .1 1 1 (b) because they amount to a general 
allegation that the claims define a patentable invention without specifically pointing 
out how the language of the claims patentably distinguishes them from the 
references. Although applicant argues the invention differs from Kanevsky, the 
Examiner contends that whether for speech synthesis or speech recognition, a 
word model involves pronunciations of words and thus, the disclosure of Kanevsky 
reads on the claimed limitations as indicated in the First Office Action and 
reiterated in the rejection above. 

Applicant further argues that a pronunciation dictionary is not probabilistic 
and that for each entry in the dictionary, there is one and only one sequence of 
phones to describe its pronunciation. In response, the Examiner argues that there 
is no need for a single pronunciation of a word, because the teachings of Kanevsky 
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allows for choosing one pronunciation from several to compensate for varying 
dialects. 

Regarding claims 2 and 5, applicant argues that Kanevsky or the references 
of Kuhn and Das does not teach combining a rule set and a dictionary. In response 
to applicant's arguments against the references individually, one cannot show 
nonobviousness by attacking references individually where the rejections are based 
on combinations of references. See In re Keller, 642 F.2d 41 3, 208 USPQ 871 
(CCPA 1981); In re Merck & Co., 800 F.2d 1091, 231 USPQ 375 (Fed. Cir. 1986). 
Applicant further argues that it is not seen where applicant's claim 2 is obvious in 
view of the references. In response, the Examiner argues that Kuhn discloses a 
method for generating spelling-to pronunciation decision tree, which decision trees 
are used to store a series of yes-no questions (which reads on the "rule set") that 
can be used to convert spelled word letter sequences into pronunciations 
(abstract), for use in a speech recognition system to allow the user to add 
additional words to the recognition dictionary which reads on "convert text to 
phones for text not in the dictionary." Further, Das teaches a system for vector 
quantization, which utilizes error vectors in the coding scheme and teaches that the 
difference between a target vector and an input vector forms an error vector, 
which represents the distortion between an input and output of the coding system, 
and encodes differences in pronunciations however obtained. 
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Thus, the combination of Kanevsky, Kuhn and Das would provide for a 
vocabulary or dictionary comprising alphabetized text and corresponding phones (as 
taught by Kanevsky); and overlapping characters with previous entry are prefix 
delta encoded (as taught by Kanevsky); further allowing for generating spelling to 
pronunciation means that is used to convert spelled word letter sequences into 
pronunciations (as provided by Kuhn) such that entries different from the rule set 
are implemented via a form of error encoding (as provided by Das). 
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Conclusion 



12. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension 
of time policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire 
THREE MONTHS from the mailing date of this action. In the event a first reply is 
filed within TWO MONTHS of the mailing date of this final action and the advisory 
action is not mailed until after the end of the THREE-MONTH shortened statutory 
period, then the shortened statutory period will expire on the date the advisory 
action is mailed, and any extension fee pursuant to 37 CFR 1 .1 36(a) will be 
calculated from the mailing date of the advisory action. In no event, however, will 
the statutory period for reply expire later than SIX MONTHS from the mailing date 
of this final action. 



Application/Control Number: 09/590,613 



Page 17 



Art Unit: 2654 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Angela A. Armstrong whose telephone number is 
703-308-6258. The examiner can normally be reached on Monday-Thursday 7:30- 
5:00 PM. 

If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Marsha Banks-Harold can be reached on (703) 305-4379. 
The fax phone numbers for the organization where this application or proceeding is 
assigned are 703-872-9314 for regular communications and 703-872-9314 for 
After Final communications. 

Any inquiry of a general nature or relating to the status of this application or 
proceeding should be directed to the receptionist whose telephone number is 703- 
306-0377. 



Angela A. Armstrong 

Examiner 

Art Unit 2654 




AAA 

December 9, 2002 



PRIMARY EXAMINER 



